Skip to content

Support multiple cuDecomp handles.#139

Merged
romerojosh merged 2 commits into
mainfrom
multi-handle-support
Jun 16, 2026
Merged

Support multiple cuDecomp handles.#139
romerojosh merged 2 commits into
mainfrom
multi-handle-support

Conversation

@romerojosh

Copy link
Copy Markdown
Collaborator

Currently, cuDecomp prohibits users from having more than one active library handle. Because of this, users have always been restricted to running cuDecomp operations on the single set of processes associated with the MPI communicator passed to cudecompInit for then entire job duration.

In order in increase the flexibility of the library, this PR relaxes this restriction and enables users to create multiple cuDecomp library handles, each potentially associated with a different MPI subcommunicator.

One restriction to this feature is that handles using NVSHMEM backends must be initialized with congruent sets of ranks. This is to work with existing restrictions on NVSHMEM finalization/reinitialization and a lack of teams-based nvshmem_malloc.

Signed-off-by: romerojosh <joshr@nvidia.com>
@romerojosh

Copy link
Copy Markdown
Collaborator Author

/build

@github-actions

Copy link
Copy Markdown

🚀 Build workflow triggered! View run

@github-actions

Copy link
Copy Markdown

✅ Build workflow passed! View run

Signed-off-by: Josh Romero <joshr@nvidia.com>
@romerojosh romerojosh merged commit fa0f294 into main Jun 16, 2026
4 checks passed
@romerojosh romerojosh deleted the multi-handle-support branch June 23, 2026 17:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant